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The stellar initial mass function (IMF) describes the mass distribution of stars at 
CSj ■ the time of their formation and is of fundamental importance for many areas of as- 

trophysics. The IMF is reasonably well constrained in the disk of the Milky Way 1 
but we have very little direct information on the form of the IMF in other galaxies 
u: and at earlier cosmic epochs. Here we investigate the stellar mass function in el- 

liptical galaxies by measuring the strength of the Na I doublet 2 ' 3 and the Wing-Ford 
molecular FeH band 4 ' 5 in their spectra. These lines are strong in stars with masses 
< 0.3 M© and weak or absent in all other types of stars. 5 7 We unambiguously detect 
both signatures, consistent with previous studies 8 that were based on data of lower 
signal-to-noise ratio. The direct detection of the light of low mass stars implies that 
they are very abundant in elliptical galaxies, making up > 80 % of the total number 
of stars and contributing > 60 % of the total stellar mass. We infer that the IMF in 
massive star-forming galaxies in the early Universe produced many more low mass 
stars than the IMF in the Milky Way disk, and was probably slightly steeper than 
the Salpeter form 9 in the mass range 0.1 - 1 M . 

o : 

We obtained spectra of eight of the most luminous and massive galaxies in the 
| nearby Universe: four of the brightest early-type galaxies in the Virgo cluster and 

four in the Coma cluster. The galaxies were selected to have velocity dispersions 
a > 250 km s 1 , and were observed with the Low-Resolution Imaging Spectrometer 10 
(LRIS) on the Keck I telescope. In 2009 the red arm of LRIS was outfitted with fully- 
depleted LBNL CCDs, which have excellent sensitivity out to A > 9000 A and virtually 
no fringing. The individual spectra of the four galaxies in each of the two clusters were 
de-redshifted, averaged, and binned to a resolution of 8 A. 

In Fig. lb,c we show the spectral region near the AA8183, 8195 Na I doublet for the 
Virgo and Coma galaxies. The doublet appears as a single absorption feature due to 
Doppler broadening. In Fig. le,f we show the region around the A9916 Wing-Ford 
band for the Virgo galaxies. This region could not be observed with sufficient signal in 
Coma as it is redshifted to 1.015 /im. The spectra are of very high quality. The median 
la scatter of the four galaxies around the average spectrum is only ~ 0.3 % per spectral 
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bin. The median absolute difference between the Virgo and Coma spectra is 0.4 % per 
spectral bin. 

Both the Na I doublet and the Wing-Ford band are unambiguously detected. The 
central wavelength of the observed Nal line coincides with the weighted average 
wavelength of the doublet and the observed Wing-Ford band has the characteristic 
asymmetric profile reflecting the A A A — X A A transition of FeH. 5 The Nal index is 
0.058 ± 0.006 mag in the Virgo galaxies and 0.057 ± 0.007 mag in Coma. The Wing- 
Ford index in Virgo galaxies is 0.027 ± 0.005. The uncertainties are determined from 
the scatter among the individual galaxies. Note that any residual systematic problems 
with the detector or atmosphere are incorporated in this scatter, as the features were 
originally redshifted to a different observed wavelength range for each of the galaxies. 

The immediate implication is that stars with masses < 0.3 M Q are present in sub- 
stantial numbers in the central regions of elliptical galaxies. Such low mass stars are 
impossible to detect individually in external galaxies, as they are too faint: Barnard's 
star would have a K band magnitude of 39 at the distance of the Virgo cluster. This in 
turn implies that there was a channel for forming low mass stars in the progenitors of 
luminous early-type galaxies in clusters. These star-forming progenitors are thought 
to be relatively compact galaxies at z = 2 — 5 with star formation rates of 10s or 100s 
of Solar masses per year. Some studies have suggested truncated IMFs for such galax- 
ies, 11 with a cutoff below 1 M . Such dwarfless IMFs are effectively ruled out by the 
detection of the Na I line and the Wing-Ford band. 

We turn to stellar population synthesis models to quantify the number of low mass 
stars in elliptical galaxies. As discussed in detail in the Supplementary Information, we 
use a flexible stellar population synthesis code 12 combined with an extensive empirical 
library of stellar near-infrared spectra. 13 In Fig. 1 we show synthetic spectra in both 
spectral regions for different choices of the IMF, 1,914 including IMFs that are steeper 
than the Salpeter form. The fits are excellent, with differences between data and the 
best-fitting model < 0.5% over the entire spectral range. Predicted line indices are 
compared to the observed values in Fig. 2. The data prefer IMFs with substantial dwarf 
populations. The best fits are obtained for a logarithmic IMF slope of x ~ — 3, a more 
dwarf-rich ("bottom-heavy") IMF than even the Salpeter form, which has x = —2.35. 
A Kroupa IMF (which is appropriate for the Milky Way) is inconsistent with the Wing- 
Ford data at > 2a and inconsistent with the Na I data at > 4<r, as are IMFs with even 
more suppressed dwarf populations. 1416 We note that the x = — 3 IMF also provides a 
much better fit to the region around 0.845/im than any of the other forms. Taking the 
Salpeter IMF as a limiting case, we find that stars with masses of 0.1 M Q - 0.3 M make 
up at least 80 % of the total number of living stars in elliptical galaxies, and contribute 
at least 60 % of the total stellar mass. 

Although the formal uncertainty in the derived IMF slope is small we stress that 
some unknown systematic effect could be present in the stellar population synthesis 
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modeling. In particular, weak features in the spectra of giant stars in elliptical galaxies 
may be incorrectly represented by the Milky Way giants that we use. It may also be 
that the Na abundance of low mass stars in elliptical galaxies is different from that of 
low mass mass stars in the Milky Way. The fact that all models fit the spectra of both 
the Virgo and Coma galaxies extremely well outside of the IMF-sensitive regions gives 
some confidence in our approach, and as we show in the Supplementary Information 
the quality of the fit constrains possible contamination of particular elements such as 
TiO. 

Besides model uncertainties, the interpretation may be complicated by the fact that 
we are constraining the IMF some 10 billion years after the stars were formed. It is now 
generally thought that elliptical galaxies have undergone several (or many) mergers 
with other galaxies after their initial collapse, 17 which may imply that the stellar pop- 
ulation is more complex than our single age, single metallicity model. On the other 
hand, these accretion events probably mostly add stars at large radii 18 and may not 
have affected the core regions very much. It will be interesting to search for gradients 
in dwarf-sensitive features with more extensive data. 19 

Our results are consistent with previous studies of the near-infrared spectra of el- 
liptical galaxies. 8 ' 20 They are also consistent with recent dynamical and lensing con- 
straints on the IMF in elliptical galaxies with large velocity dispersions 21 and directly 
identify the stars that are responsible for their high masses: the dynamical data can- 
not distinguish dwarf-rich IMFs from dwarf-deficient IMFs as the latter have a large 
amount of mass in stellar remnants. 14 A steep IMF for elliptical galaxies is also quali- 
tatively consistent with the apparently higher number of low mass stars in the Milky 
Way bulge than in the disk. 22 Our best-fitting IMF does not appear to be consistent with 
the observed color and M/L evolution of massive cluster galaxies, 14 which suggest an 
IMF with a slope x ~ around ~ 1M . Interpreting the evolution of the colors and 
luminosities of elliptical galaxies relies on the assumption that these galaxies evolve in 
a self-similar way, which may not be valid. 18 ' 23 It could also be that the form of the IMF 
is more complex than a powerlaw. 

Our results also seem inconsistent with theoretical arguments for dwarf-deficient 
IMFs at high redshift, which have centered on the idea that the characteristic mass 
of stars scales with the Jeans mass in molecular clouds. 24 ' 25 The Jeans mass has a 
strong temperature dependence and it has been argued that relatively high ambient 
temperatures in high-redshift star forming galaxies may have set a floor to the char- 
acteristic mass in the progenitors of elliptical galaxies. 2414 However, the Jeans mass 
also scales with density, and the gas densities in the star-forming progenitors of the 
cores of elliptical galaxies were almost certainly significantly higher than typical den- 
sities of star forming regions in the Milky Way. Numerical simulations suggest that 
the formation of low mass stars becomes inevitable if sufficiently high densities are 
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reached on sub-parsec scales. 26 Furthermore, recent semi-analyic models of the ther- 
mal evolution of gas clouds have emphasized the effects of dust-induced cooling, 27 
which is relatively insensitive to the ambient temperature and particularly effective at 
high densities. Time-scale arguments suggest that the physical conditions expected in 
starburst galaxies at high redshift might even enhance low-mass star formation, rather 
than suppress it. 28 

Taken at face value, our results imply that the form of the IMF is not universal 
but depends on the prevailing physical conditions: Kroupa-like in quiet, star-forming 
disks and dwarf-rich in the progenitors of massive elliptical galaxies. This informs 
models of star formation and has important implications for the interpretation of ob- 
servations of galaxies in the early Universe. The stellar masses and star formation rates 
of distant galaxies are usually estimated from their luminosities, assuming some form 
of the IMF. 29 ' 30 Our results suggest that a different form should be used for different 
galaxies, greatly complicating the analysis. The bottom-heavy IMF advocated here 
may also require a relatively low fraction of dark matter within the central regions of 
nearby massive galaxies. 21 
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Figure 1 | Detection of the Nal doublet and the Wing-Ford band, a, Spectra in the vicinity 
of the AA8183, 8195 Na I doublet for three stars from the IRTF library: 12 a KO giant, which dom- 
inates the light of old stellar populations; an M6 dwarf, whose (small) contribution to the inte- 
grated light is sensitive to the form of the IMF at low masses; and an M3 giant, which has po- 
tentially contaminating TiO spectral features in this wavelength range, b, Averaged Keck/LRIS 
spectra of NGC 4261, NGC 4374, NGC 4472, and NGC 4649 in the Virgo cluster (black line) and 
NGC4840, NGC 4926, IC3976, and NGC 4889 in the Coma cluster (grey line). Four exposures 
of 180 s were obtained for each galaxy. The one-dimensional spectra were extracted from the 
reduced two-dimensional data by summing the central 4", which corresponds to ps 0.4 kpc at 
the distance of Virgo and « 1.8 kpc at the distance of Coma. We found little or no dependence 
of the results on the choice of aperture. Colored lines show stellar population synthesis models 
for a dwarf-deficient "bottom-light" IMF, 14 a dwarf-rich "bottom-heavy" IMF with x = —3, 
and an even more dwarf-rich IMF. The models are for an age of 10 Gyr and were smoothed to 
the average velocity dispersion of the galaxies. The x = — 3 IMF fits the spectrum remarkably 
well, c, Spectra and models around the dwarf-sensitive Na I doublet. A Kroupa IMF, which 
is appropriate for the Milky Way, does not produce a sufficient number of low mass stars to 
explain the strength of the absorption. An IMF steeper than Salpeter appears to be needed. 
d,e,f, Spectra and models near the A9916 Wing-Ford band. The observed Wing-Ford band also 
favors an IMF that is more abundant in low mass stars than the Salpeter IMF. All spectra and 
models were normalized by fitting low order polynomials (excluding the feature of interest). 
The polynomials were quadratic in a,b,d,e and linear in c,f . 

Figure 2 | Constraining the IMF. a, Various stellar IMFs, ranging from a "bottom-light" IMF 
with strongly suppressed dwarf formation 14 (light blue) to an extremely "bottom-heavy" IMF 
with a slope x = -3.5. The IMFs are normalized at 1 M©, as stars of approximately that mass 
dominate the light of elliptical galaxies, b, Comparison of predicted line Nal and Wing-Ford 
indices to the observed values. The indices were defined analogous to refs. 4 and 8. The Nal 
index has central wavelength 0.8195 \i and side bands at 0.816 /im and 0.825 /mi. The Wing- 
Ford index has central wavelength 0.992 /im and side bands at 0.985 |im and 0.998 fim. The 
central bands and side bands are all 20 A wide. Both observed line indices are much stronger 
than expected for a Kroupa IMF. The best fits are obtained for IMFs that are slightly steeper 
than Salpeter. 
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Supplementary Information: Stellar population synthesis modeling 

The stellar population model used in the present analysis was constructed specifically for this 
project. We therefore describe it in some detail below. 

Construction of the model: 

The IRTF spectral library 1 ' 2 constitutes the core of our model. This spectral library is novel in 
its near-IR coverage of a large sample of cool stars at a resolving power of R ~ 2000. 65 stars 
were selected from this library with luminosity classes III and V (giants and dwarfs). Pecu- 
liar stars were removed, including stars with non-solar Fe abundance and /or other peculiar 
abundance patterns (mainly Ca, CN, CH, and Ba), as well as known pulsating M giants. Stars 
with significant reddening were also removed. The IRTF library includes 2MASS JHK and 
literature V band photometry for each star. 

We estimated effective temperatures for these stars based on empirical V — K vs. T c g re- 
lations that have been separately determined for giants 3 5 and dwarfs. 6 ' 7 The adopted dwarf 
relations agree to within 100 K with a recently updated temperature scale. 8 ' 9 

The IRTF library is absolute-flux calibrated against 2MASS photometry. We adopt paral- 
lax values obtained from SIMBAD in order to derive absolute luminosities. In order to derive 
bolometric luminosities, which we will use to assign stellar masses, we must extrapolate the 
IRTF library beyond the observed range. We perform the extrapolation by utilizing the latest 
version of the PHOENIX model spectra. The model spectra are normalized to the IRTF library 
at the blue and red ends of the observed wavelength range so that the extrapolation is contin- 
uous across the transition region. The model spectrum used to extrapolate each IRTF stellar 
spectrum was chosen from the estimated T e g for each IRTF star and an estimate of the surface 
gravity of the star based on its luminosity class. We note that the extrapolation does not lead 
to large uncertainties as the IRTF wavelength range (0.8 — 2.5 fim) encompasses the majority of 
the bolometric flux, especially for K and M stars. As an example, 80 % of the flux from M-type 
stars is emitted within the IRTF spectral range. 

The resulting T e g and Lbol for each IRTF star is plotted in Supp. Fig. 1. Also shown in this 
Figure are model isochrones (described below). The agreement between the data and models 
is remarkable, and provides a valuable consistency check on both the estimated T e g and Lboi 
values. 

We adopt a combination of isochrones in order to utilize the most accurate stellar models 
for each range in stellar mass. At the lowest masses we adopt the Baraffe stellar evolution 
calculations, 10 which are unique in their use of realistic atmospheric boundary conditions. Be- 
tween 0.2 M Q and the tip of the RGB we use the Dartmouth evolutionary calculations, 11 which 
have been shown to produce accurate fits to photometry of open and globular clusters of a 
variety of ages and metallicities. 12 The Padova evolutionary tracks 13 are used to extend the 
isochrones through the horizontal branch and asymptoptic giant branch evolutionary phases. 
This combined isochrone set is shown in Supp. Fig. 1. 

The isochrones are used to assign masses to each IRTF spectrum, based on the measured 
Lboi for each star. We note that the stellar T c r values are used only to choose the appropri- 
ate PHOENIX model spectra to extrapolate the observed spectra. Our conclusions remain 
unchanged if we instead assign masses based on the estimated T cff for each star. Alpha- 
enhancements of 0.2 dex have a very minor effect on the L bo i - M and T cS - M relations, 
when [Fe/H] is held constant. As a final test of our procedure we compare the masses to those 
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predicted from empirical relations between mass and K— band luminosity derived for stars 
in binary systems. 14,15 Such relations are completely independent of stellar evolution calcula- 
tions because the masses are derived from dynamical modeling. The stellar mass - K— band 
luminosity relation for our IRTF stars agrees very well with the relations based on dynamical 
masses in the range 0.08 < M < 0.6 M Q . The agreement is particularly strong when bolometric 
luminosities are used to assign masses, which motivates the use of bolometric luminosities as 
our default method. Based on these comparisons, we conclude that our mass estimates are 
reliable. 

Once masses are assigned, a synthesized spectrum is created by integrating the observed 
spectra over an initial mass function. 

Metallicity and a-enhancement: 

The empirical spectral library that we use has important advantages over libraries of model 
atmospheres. Model atmospheres do not reproduce molecular features such as TiO very well, 
which makes it difficult to interpret fits of these models to our spectra. We note here that Kurucz 
models — which are used in most existing stellar population synthesis models at A > 8000 A 
— also prefer IMFs that are more "bottom-heavy" than the Kroupa IMF, but give much poorer 
fits than the IRTF empirical library to the spectral regions away from Na I and the Wing-Ford 
band. 

A disadvantage of the empirical library is that it is not straightforward to assess the ef- 
fects of abundance variations. We know that the iron abundance in the eight Coma and Virgo 
ellipticals is approximately Solar, 16,17 and so the value of [Fe/H] of the Milky Way stars is 
appropriate for the galaxies that we study here. However, the abundance of a-elements (and 
the overall mass-weighted metallicity [Z/H]) is higher in massive elliptical galaxies than in the 
Milky Way stars. Although sodium (responsible for the Na I line) and iron (responsible for the 
Wing-Ford band) are themselves not a-elements, both lines and their sidebands overlap with 
TiO lines that are present in M giants. 

We empirically determined to what extent enhanced TiO absorption could influence the 
results by creating models with artifically enhanced late M giant light. The model shown in 
Supp. Fig. 2 has a Kroupa IMF and an additional contribution of late M giants amounting to 
40 % of the light at 0.9 fim. This is an extreme model as these stars should contribute only 
a few percent to the light, but we use it here to assess the effects of enhancing TiO features. 
This model fits the observed strength of the Wing-Ford band quite well as it coincides with the 
5(2 — 3) band of TiO. However, it also predicts that other TiO features are nearly as strong as 
the absorption at 0.99 /im, which is clearly not the case. Moreover, the fit to the Nal line is not 
improved as the only relevant TiO feature is slightly redward of Na I, and the fit to the rest 
of the spectrum in the wavelength range 0.815 — 0.860 |im is unacceptable. We note that the 
apparently limited enhancement of TiO with respect to the Milky Way giants is consistent with 
the weak response of TiO to either metallicity or a-enhancement in the PHOENIX library and 
the models of ref. 18. 

We infer that the Wing-Ford band should be interpreted with caution but that the combina- 
tion of the Wing-Ford band with TiO features at other wavelengths and (particularly) the Na I 
line should be a robust indicator of the presence of low mass stars. This is fully consistent with 
previous theoretical work. 19 
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Supplementary Figure 1 | The IRTF stellar library. Hertzprung-Russell diagram of IRTF 
stars and theoretical stellar evolution calculations. IRTF stellar L bol and T ef j have been esti- 
mated as described in the text. Theoretical models assume [Fe/H] = 0.0 and an age of 13.7 Gyr. 
The models encompass all phases of stellar evolution from main sequence hydrogen burning 
through the end of the asymptotic giant branch, and extend down to the hydrogen burning 
limit of 0.08 M . 

Supplementary Figure 2 | Effect of enhancing M giant features, a Spectra and models 
around the Nal doublet. Our favored model with a steep IMF (x = —3.0) is compared to a 
model that assumes a Kroupa IMF with the addition of a 40% light contribution from an M6 
giant star. Since late M giants have very strong TiO features this exercise is meant to mimic 
the effects of increasing a-enhancement in the models. Clearly the addition of a substantial 
amount of M giant light provides a poor fit to the absorption line at 0.82 as the central 
wavelength is not matched. This demonstrates that this feature is due to Na I and not TiO. The 
fit is also very poor to the rest of the spectrum in this wavelength range, b Spectra and models 
around the Wing-Ford band. Here the addition of M giant light adequately reproduces the 
strong observed Wing-Ford absorption due to the TiO absorption feature that coincides with 
the FeH absorption. However, this fit also predicts unacceptably large absorption at 0.983 /im, 
1.001 fim, and 1.006 /im as these TiO features are nearly as strong as the one at 0.992 /im. 
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